A NoC-level Support for Broadcast-based Coherence Protocols
نویسندگان
چکیده
Chip Multiprocessor Systems (CMPs) rely on a cache coherency protocol to maintain memory access coherence between cached data and main memory. The Hammer coherency protocol is appealing as it eliminates most of the space overhead when compared to a directory protocol. However, it generates much more traffic, thus stressing the NoC and having worse performance in terms of power consumption. When using a NoC with built-in broadcast support network utilization is lowered but does not solve completely the problem as acknowledgment messages are still sent from each core to the memory access requestor. In this paper we propose a small and simple control network that collects the acknowledgement messages and delivers them with a bounded and fixed latency, thus relieving the NoC from a large amount of messages. Experimental results demonstrate on a 16-tile system with the control network that execution time improves up to 17%, with an average improvement of about 7.5%. Keywords—CMPs, cache hierarchy, NoC
منابع مشابه
Adding Token Counting to Directory-Based Cache Coherence
The coherence protocol is a first-order design concern in multicore designs. Directory protocols are naturally scalable, as they place no restrictions on the interconnect and have minimal bandwidth requirements; however, this scalability comes at the cost of increased sharing latency due to indirection. In contrast, broadcast-based systems such as snooping protocols and token coherence reduce l...
متن کاملImpact of Cache Coherence Protocols on the Power Consumption of STT-RAM-Based LLC
To gain higher density and lower leakage, STT-RAM has been considered an alternative to SRAM for implementing last-level caches (LLCs). However, STT-RAM requires high write energy to program. Consequently, frequent write-backs from the upper-level caches or cache fills from the main memory will result in high LLC power. Both the broadcast and write-back traffic are affected by the cache coheren...
متن کاملOperating System Support for Flexible Coherence in Distributed Shared Memory
COMMOS1 is an operating system architecture developed to support shared persistent data objects in distributed systems. This paper describes its support for flexible coherence. The approach is based on a microkernel, typed memory objects and integrated coherence control. The coherence server is clearly separated from the external pager. This separation makes it easier to provide multiple cohere...
متن کاملMemory Architecture and Management in an NoC Platform
The memory organization and the management of the memory space is a critical part of every NoC based platform design. We propose a Data Management Engine (DME), that is a block of programmable hardware and part of every processing element. It off-loads the processing element (CPU, DSP, etc.) by managing the memory space, memory access and the communication over the on-chip network. The DME’s ma...
متن کاملEnergy-Efficient Cache Coherence Protocol for NoC-based MPSoCs
As the number of cores and functionalities integrated in embedded devices increases, the amount of memory used on these devices also increases, justifying the development of memory architectures presenting scalability, low energy consumption and low latency. To implement memory solutions, most works adopting NoC-based MPSoCs only employ basic communication services, such as send/receive, withou...
متن کامل